Econometrics at Scale: Spark up Big Data in Economics
نویسندگان
چکیده
This paper provides an overview of how to use “big data” for social science research (with emphasis on economics and finance). We investigate the performance ease different Spark applications running a distributed file system enable handling analysis data sets which were previously not usable due their size. More specifically, we explain (i) explore big exceed retail grade computers memory size (ii) run typical statistical/econometric tasks including cross sectional, panel time series regression models are prohibitively expensive evaluate stand-alone machines. By bridging gap between abstract concept ready-to-use examples can easily be altered suite researchers need, provide economists scientists more generally with theory practice handle ever growing datasets available. The reproducing in this makes guide useful reference limited background computing.
منابع مشابه
Conquering Big Data with Spark
Today, big and small organizations alike collect huge amounts of data, and they do so with one goal in mind: extract "value" through sophisticated exploratory analysis, and use it as the basis to make decisions as varied as personalized treatment and ad targeting. To address this challenge, we have developed Berkeley Data Analytics Stack (BDAS), an open source data analytics stack for big data ...
متن کاملBig Data in Economics
The success of modern economics results to a large extent from the availability of data sources which allowed us to quantify human behavior, from individual purchases in supermarkets to the degree of interconnectedness between global financial markets. As such many economists are already comfortable working with large datasets, from financial transactions to census datasets. Without new analyti...
متن کاملBig Data: New Tricks for Econometrics
Nowadays computers are in the middle of most economic transactions. These “computer-mediated transactions” generate huge amounts of data, and new tools can be used to manipulate and analyze this data. This essay offers a brief introduction to some of these tools and methods. Computers are now involved in many economic transactions and can capture data associated with these transactions, which c...
متن کاملSpark-BDD: Debugging Big Data Applications
Apache Spark has become a key platform for Big Data Analytics, yet it lacks complete support for debugging analytics programs. As a result, the development of a new analytical toolkit can be a painstakingly long process [7, 2, 4]. To fill this gap, we are developing Spark-BDD (Big Data Debugger), which brings a traditional interactive debugger experience to the Spark platform. Analytic programm...
متن کاملAdvances in Economics and Econometrics
This is the third of three volumes containing edited versions of papers and a commentary presented at invited symposium sessions of the Ninth World Congress of the Econometric Society, held in London in August 2005. The papers summarize and interpret key developments, and they discuss future directions for a wide variety of topics in economics and econometrics. The papers cover both theory and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of data science
سال: 2022
ISSN: ['1680-743X', '1683-8602']
DOI: https://doi.org/10.6339/22-jds1035